Lexicographic Semirings for Exact Automata Encoding of Sequence Models
نویسندگان
چکیده
In this paper we introduce a novel use of the lexicographic semiring and motivate its use for speech and language processing tasks. We prove that the semiring allows for exact encoding of backoff models with epsilon transitions. This allows for off-line optimization of exact models represented as large weighted finite-state transducers in contrast to implicit (on-line) failure transition representations. We present preliminary empirical results demonstrating that, even in simple intersection scenarios amenable to the use of failure transitions, the use of the more powerful lexicographic semiring is competitive in terms of time of intersection.
منابع مشابه
Applications of Lexicographic Semirings to Problems in Speech and Language Processing
This paper explores lexicographic semirings and their application to problems in speech and language processing. Specifically, we present two instantiations of binary lexicographic semirings, one involving a pair of tropical weights, and the other a tropical weight paired with a novel string semiring we term the categorial semiring. The first of these is used to yield an exact encoding of backo...
متن کاملSynchronous composition of interval weighted automata using tensor algebra of product semirings
Interval weighted automata are introduced as automata with weights in a product dioid (idempotent semiring). They constitute an extension of (max,+) automata since they enable us to model temporal constraints (instead of exact durations) for transitions. Their synchronous composition, that coincides with the synchronous product of underlying (one clock) timed automata, results in multi-event in...
متن کاملInterprocedural Dataflow Analysis over Weight Domains with Infinite Descending Chains
We study generalized fixed-point equations over idempotent semirings and provide an efficient algorithm for the detection whether a sequence of Kleene’s iterations stabilizes after a finite number of steps. Previously known approaches considered only bounded semirings where there are no infinite descending chains. The main novelty of our work is that we deal with semirings without the boundedne...
متن کاملOn Possibilistic Automata
General automata are considered with respect to normal-ization over semirings. Possibilistic automata are deened as normal pessimistic fuzzy automata. Possibilistic automata are analogous to stochas-tic automata where stochastic (+=) semirings are replaced by possi-bilistic (_=^) semirings; but where stochastic automata must be normal, fuzzy automata may be (resulting in possibilistic automata)...
متن کاملBounds for Tree Automata with Polynomial Costs
We consider tree automata with costs over semirings in the sense of (Seidl, 1994). We de ne the concept of a nitely factorizing semiring and of a monotonic semiring, both as the generalization of well-known particular semirings, and show that the costniteness of tree automata with costs over nitely factorizing and monotonic semirings is decidable. We show that, for tree automata with costs over...
متن کامل